• ConnNet: A Long-Range Relation-Aware Pixel-Connectivity Network for Salient Segmentation 

      Kampffmeyer, Michael C.; Dong, Nanqing; Liang, Xiaodan; Zhang, Yujia; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-12-14)
      Salient segmentation aims to segment out attention-grabbing regions, a critical yet challenging task and the foundation of many high-level computer vision applications. It requires semantic-aware grouping of pixels into salient regions and benefits from the utilization of global multi-scale contexts to achieve good local reasoning. Previous works often address it as two-class segmentation problems ...
    • Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos 

      Li, Haoyuan; Dong, Haoye; Jia, Hanchao; Huang, Dong; Kampffmeyer, Michael Christian; Lin, Liang; Liang, Xiaodan (Journal article; Tidsskriftartikkel; Peer reviewed, 2024-01-15)
      Multi-person 3D mesh recovery from videos is a critical first step towards automatic perception of group behavior in virtual reality, physical therapy and beyond. However, existing approaches rely on multi-stage paradigms, where the person detection and tracking stages are performed in a multi-person setting, while temporal dynamics are only modeled for one person at a time. Consequently, their ...
    • DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment 

      Zhang, Xujie; Yang, Binbin; Kampffmeyer, Michael Christian; Zhang, Wenqing; Zhang, Shiyue; Lu, Guansong; Lin, Liang; Xu, Hang; Liang, Xiaodan (Journal article; Tidsskriftartikkel; Peer reviewed, 2024-01-15)
      Cross-modal garment synthesis and manipulation will significantly benefit the way fashion designers generate garments and modify their designs via flexible linguistic interfaces. However, despite the significant progress that has been made in generic image synthesis using diffusion models, producing garment images with garment part level semantics that are well aligned with input text prompts and ...
    • Dilated temporal relational adversarial network for generic video summarization 

      Zhang, Yujia; Kampffmeyer, Michael C.; Liang, Xiaodan; Zhang, Dingwen; Tan, Min; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2019-10-12)
      The large amount of videos popping up every day, make it more and more critical that key information within videos can be extracted and understood in a very short time. Video summarization, the task of finding the smallest subset of frames, which still conveys the whole story of a given video, is thus of great significance to improve efficiency of video understanding. We propose a novel Dilated ...
    • M3D-VTON: A Monocular-to-3D Virtual Try-On Network 

      Zhao, Fuwei; Xie, Zhenyu; Kampffmeyer, Michael; Dong, Haoye; Han, Songfang; Zheng, Tianxiang; Zhang, Tao; Liang, Xiaodan (Journal article; Tidsskriftartikkel; Peer reviewed, 2022-02-28)
      Virtual 3D try-on can provide an intuitive and realistic view for online shopping and has a huge potential commercial value. However, existing 3D virtual try-on methods mainly rely on annotated 3D human shapes and garment templates, which hinders their applications in practical scenarios. 2D virtual try-on approaches provide a faster alternative to manipulate clothed humans, but lack the rich and ...
    • Reinforced Auto-Zoom Net: Towards Accurate and Fast Breast Cancer Segmentation in Whole-Slide Images 

      Dong, Nanqing; Kampffmeyer, Michael C.; Liang, Xiaodan; Wang, Zeya; Dai, Wei; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-09-20)
      Convolutional neural networks have led to significant breakthroughs in the domain of medical image analysis. However, the task of breast cancer segmentation in whole-slide images (WSIs) is still underexplored. WSIs are large histopathological images with extremely high resolution. Constrained by the hardware and field of view, using high-magnification patches can slow down the inference process and ...
    • Rethinking knowledge graph propagation for zero-shot learning 

      Kampffmeyer, Michael C.; Chen, Yinbo; Liang, Xiaodan; Wang, Hao; Zhang, Yujia; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2019)
      Graph convolutional neural networks have recently shown great potential for the task of zero-shot learning. These models are highly sample efficient as related concepts in the graph structure share statistical strength allowing generalization to new classes when faced with a lack of data. However, multi-layer architectures, which are required to propagate knowledge to distant nodes in the graph, ...
    • Towards robust partially supervised multi-structure medical image segmentation on small-scale data 

      Dong, Nanqing; Kampffmeyer, Michael; Liang, Xiaodan; Xu, Min; Voiculescu, Irina; Xing, Eric (Journal article; Tidsskriftartikkel; Peer reviewed, 2021-11-20)
      The data-driven nature of deep learning (DL) models for semantic segmentation requires a large number of pixel-level annotations. However, large-scale and fully labeled medical datasets are often unavailable for practical tasks. Recently, partially supervised methods have been proposed to utilize images with incomplete labels in the medical domain. To bridge the methodological gaps in ...
    • Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN 

      Xie, Zhenyu; Huang, Zaiyu; Zhao, Fuwei; Dong, Haoye; Kampffmeyer, Michael; Liang, Xiaodan (Journal article; Tidsskriftartikkel; Peer reviewed, 2021)
      Image-based virtual try-on is one of the most promising applications of human-centric image generation due to its tremendous real-world potential. Yet, as most try-on approaches fit in-shop garments onto a target person, they require the laborious and restrictive construction of a paired training dataset, severely limiting their scalability. While a few recent works attempt to transfer garments ...
    • Unsupervised domain adaptation for automatic estimation of cardiothoracic ratio 

      Dong, Nanqing; Kampffmeyer, Michael C.; Liang, Xiaodan; Wang, Zeya; Dai, Wei; Xing, Eric P. (Journal article; Tidsskriftartikkel; Peer reviewed, 2018-09-26)
      The cardiothoracic ratio (CTR), a clinical metric of heart size in chest X-rays (CXRs), is a key indicator of cardiomegaly. Manual measurement of CTR is time-consuming and can be affected by human subjectivity, making it desirable to design computer-aided systems that assist clinicians in the diagnosis process. Automatic CTR estimation through chest organ segmentation, however, requires large amounts ...